Interpretation des predictions maisons
Author : VotreNom
Description : Rapport Shapash pour maisons
Project_Name : Analyse randomforest_maison
Model used : RandomForestRegressor
Library : sklearn.ensemble._forest
Library version : 1.5.2
Model parameters :
| Parameter key | Parameter value |
|---|---|
| estimator | DecisionTreeRegressor() |
| n_estimators | 174 |
| estimator_params | ('criterion', 'max_depth', 'min_samples_split', 'min_samples_leaf', 'min_weight_fraction_leaf', 'max_features', 'max_leaf_nodes', 'min_impurity_decrease', 'random_state', 'ccp_alpha', 'monotonic_cst') |
| bootstrap | True |
| oob_score | False |
| n_jobs | None |
| random_state | None |
| verbose | 0 |
| warm_start | False |
| class_weight | None |
| max_samples | None |
| criterion | squared_error |
| max_depth | 16 |
| min_samples_split | 2 |
| Parameter key | Parameter value |
|---|---|
| min_samples_leaf | 2 |
| min_weight_fraction_leaf | 0.0 |
| max_features | 1.0 |
| max_leaf_nodes | None |
| min_impurity_decrease | 0.0 |
| ccp_alpha | 0.0 |
| monotonic_cst | None |
| feature_names_in_ | ['etage' 'surface' 'surface_terrain' 'nb_pieces' 'balcon' 'eau' 'bain' 'dpeL' 'dpeC' 'mapCoordonneesLatitude' 'mapCoordonneesLongitude' 'annonce_exclusive' 'nb_etages' 'places_parking' 'cave' 'ges_class' 'annee_construction' 'nb_toilettes' 'ascenseur' 'chauffage_energie' 'chauffage_systeme' 'chauffage_mode'... |
| n_features_in_ | 55 |
| _n_samples | 11132 |
| n_outputs_ | 1 |
| _n_samples_bootstrap | 11132 |
| estimator_ | DecisionTreeRegressor() |
| estimators_ | [DecisionTreeRegressor(max_depth=16, max_features=1.0, min_samples_leaf=2, random_state=307423091), DecisionTreeRegressor(max_depth=16, max_features=1.0, min_samples_leaf=2, random_state=1606866035), DecisionTreeRegressor(max_depth=16, max_features=1.0, min_samples_leaf=2, random_state=1621316768),... |
| Training dataset | Prediction dataset | |
|---|---|---|
| number of features | NaN | 55 |
| number of observations | NaN | 2,783 |
| missing values | NaN | 0 |
| % missing values | NaN | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 222 |
| std | 133 |
| min | 0 |
| 25% | 102 |
| 50% | 222 |
| 75% | 339 |
| max | 448 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 0.0159 |
| std | 1.02 |
| min | -1.72 |
| 25% | -0.704 |
| 50% | -0.177 |
| 75% | 0.426 |
| max | 3.66 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0333 |
| std | 0.997 |
| min | -0.996 |
| 25% | -0.996 |
| 50% | -0.541 |
| 75% | 0.874 |
| max | 1.62 |
| Prediction dataset | |
|---|---|
| distinct values | 5 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 5 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 0.0223 |
| std | 0.98 |
| min | -4.93 |
| 25% | -0.375 |
| 50% | 0.122 |
| 75% | 0.677 |
| max | 1.58 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 1 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 8 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 4 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 5 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 4 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 4 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 0.00942 |
| std | 1.02 |
| min | -1.53 |
| 25% | -0.9 |
| 50% | -0.204 |
| 75% | 0.565 |
| max | 5.58 |
| Prediction dataset | |
|---|---|
| distinct values | 7 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 5 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 1 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 8 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 8 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0067 |
| std | 0.986 |
| min | -2 |
| 25% | -0.777 |
| 50% | -0.154 |
| 75% | 0.694 |
| max | 2.66 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0175 |
| std | 0.995 |
| min | -2.94 |
| 25% | -0.632 |
| 50% | 0.0483 |
| 75% | 0.61 |
| max | 2.1 |
| Prediction dataset | |
|---|---|
| distinct values | 4 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0127 |
| std | 0.986 |
| min | -2.38 |
| 25% | -0.357 |
| 50% | -0.357 |
| 75% | 0.149 |
| max | 7.24 |
| Prediction dataset | |
|---|---|
| distinct values | 8 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.00689 |
| std | 0.989 |
| min | -0.407 |
| 25% | -0.396 |
| 50% | -0.357 |
| 75% | -0.197 |
| max | 3.61 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0115 |
| std | 0.978 |
| min | -1.27 |
| 25% | -0.791 |
| 50% | -0.131 |
| 75% | 0.55 |
| max | 6.25 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0081 |
| std | 0.989 |
| min | -0.404 |
| 25% | -0.396 |
| 50% | -0.366 |
| 75% | -0.221 |
| max | 3.63 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0296 |
| std | 0.941 |
| min | -1.33 |
| 25% | -0.826 |
| 50% | -0.188 |
| 75% | 0.58 |
| max | 5.48 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 0.00293 |
| std | 1.02 |
| min | -0.608 |
| 25% | -0.503 |
| 50% | -0.387 |
| 75% | -0.134 |
| max | 4.12 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.000241 |
| std | 1.05 |
| min | -0.656 |
| 25% | -0.542 |
| 50% | -0.341 |
| 75% | 0.0772 |
| max | 8.01 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.00509 |
| std | 0.989 |
| min | -0.399 |
| 25% | -0.381 |
| 50% | -0.354 |
| 75% | -0.18 |
| max | 3.6 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 0.0322 |
| std | 1.03 |
| min | -2.04 |
| 25% | -0.644 |
| 50% | -0.118 |
| 75% | 0.411 |
| max | 5.16 |
| Prediction dataset | |
|---|---|
| distinct values | 1 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 1 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| distinct values | 8 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 0.00658 |
| std | 1.04 |
| min | -0.954 |
| 25% | -0.954 |
| 50% | 0.0667 |
| 75% | 0.388 |
| max | 4.2 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.00842 |
| std | 0.984 |
| min | -0.436 |
| 25% | -0.404 |
| 50% | -0.364 |
| 75% | -0.185 |
| max | 3.83 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0106 |
| std | 1.01 |
| min | -6.72 |
| 25% | -0.729 |
| 50% | -0.145 |
| 75% | 0.605 |
| max | 3.38 |
| Prediction dataset | |
|---|---|
| distinct values | 2 |
| missing values | 0 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0184 |
| std | 1 |
| min | -1.64 |
| 25% | -0.619 |
| 50% | -0.31 |
| 75% | 0.229 |
| max | 7.86 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | -0.0261 |
| std | 0.899 |
| min | -0.81 |
| 25% | -0.353 |
| 50% | -0.233 |
| 75% | 0.0151 |
| max | 9.52 |
| Prediction dataset | |
|---|---|
| count | 2,783 |
| mean | 2,710 |
| std | 951 |
| min | 150 |
| 25% | 2,100 |
| 50% | 2,700 |
| 75% | 3,250 |
| max | 8,190 |
Note : the explainability graphs were generated using the test set only.
| True values | Prediction values | |
|---|---|---|
| count | 2,783 | 2,783 |
| mean | 2,710 | 2,700 |
| std | 951 | 684 |
| min | 150 | 695 |
| 25% | 2,100 | 2,270 |
| 50% | 2,700 | 2,680 |
| 75% | 3,250 | 3,130 |
| max | 8,190 | 6,510 |
MAE : 421
R2 : 0.607
MSE : 355,000
MAPE : 0.189
MdAE : 309
Explained Variance : 0.607